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© In a recognition/diagnosis method, a relation between total sums of inputs of respective cases and teacher 
data is listed in the order of magnitude of the total sums of inputs. Based on the value of the teacher data for the 
input having the maximum total sum and the number of times of change of teacher data in the table are 
considered, a configuration of a neural network (the number of hidden layers and the number of neurons thereof) 
is determined. Coupling weights are analytically calculated based on the table. 
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BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

The present invention relates to relates to a recognition or diagnosis method primarily using a neural 
network for diagnosing the presence or absence of a fault in a current operation status by utilizing past 
operation information of various components of a power supply circuit such as a gas insulated switchgear 
(GIS) or a transformer. It is applicable to not only various engineering fields but also a medical field, and 
more generally to the recognition and diagnosis of physical, chemical and biological phenomena. 

DESCRIPTION OF THE RELATED ART 



In JP-A-2-297074, a diagnosis method by using a neural network is proposed. In the method, past 
status of various types of equipments are previously learnt by a neural network having a multi-layer 
75 structure. When new status of the equipments is inputted, whether the equipments are normal or not is 
determined based on the learn result. If they are determined to be abnormal, a cause therefor is inferred. 

In JP-A-2-272326, detailed description is made on mechanical vibration, acoustic vibration and electric 
oscillation of a rotary machine. 

In JP-A-2-1 00757, a learning method of a parallel neural network is proposed. It is stated therein that a 
20 back propagation method is becoming effective as a learning method. A conventional learning method has 
disadvantages in that the learning does not proceed once it falls in a local minimum and a precision of 
learning sharply decreases when the number of data to be learnt is large. In the proposed method, more 
than one neural networks are connected in parallel to serially learn in order to improve an efficiency and the 
precision of the learning. Other arts related to the present invention are disclosed in USP 5,023,045, JP-A-2- 
25 206721 , JP-A-4-1 20479, JP-A-3-201 079 and JP-A-3-092975. 

Fig. 1 shows a conventional neural network. Any number (one in the illustrated example) of hidden 

layers are provided between an input layer and an output layer. A set of input data X1.X2.X3 X n are 

inputted to the input layer. In Fig. 1, "1" is also always supplied for adjusting outputs from neurons. 
Products of those inputs and coupling weights W1, W 2 , W 3 , . . ., W n , W n+1 are inputs U1 to a neuron Ne. An 
30 output from the neuron is Vi . Ui and Vi are calculated by equations (1 ) and (2). 

35 



±— (2) 

40 

Similar equations are applied to the inputs and the outputs of other neurons of the hidden layers and 
the output layers. Only one Na of the neurons of each hidden layer is coupled with "1 " for adjusting the 
output of a neuron connected thereto which the output of the connected is equal to *1 w to provide a 

45 predetermined coupling weight. 

Thus, in the conventional neural network, an operation of learning is essential. The learning means that 
coupling weights among the neurons are determined through iterative calculations so that a calculated value 
approximated to data called teacher data which is predetermined for input data is outputted from the output 
layer when the input data is applied. For example, it is the learning to gradually modify the coupling weights 

50 through the iterative calculations, to reduce an error which is defined as 0.5 times of a square sum of 
differences between the teacher data and the calculated output. If the learning proceeds well, the error 
decreases, but in some cases the error does not decrease and the calculated output is not attained with a 
sufficient precision even after a large number of times of learning. Such a case is called that the neural 
network has fallen in a local minimum. In this case, it is not possible to get out of the local minimum 

55 whatever number of times of the learning is increased and appropriate coupling weights are not determined. 
Even if the calculated output converges to a desired value, a large number of times of learning requires 
a very long time (for example, several hours or more) depending on the scale of the neural network and the 
teacher data. 
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In the conventional neural network, the optimum number of hidden layers and the number of neurons 
are determined by a trial and error method. It is inefficient and the improvement thereof has been desired. 

SUMMARY OF THE INVENTION 

5 

ft is an object of the present invention to provide a method having a high generalization ability to solve 
the above problems. 

A method for analytically determining coupling weights of neurons from input data and teacher data has 
not been known but the inventors recently developed a method for analytically determining coupling 
io weights of neurons, which is utilized in the present invention. 

Where the coupling weight is to be determined by iterative learning, the above method for analytically 
determining the coupling weights may be utilized. For example, the number of hidden layers, the number of 
neurons thereof and what is made clear relating to the coupling weight are partially utilized. The following 
means is further utilized. 

75 (1) As will be described in detail hereinlater, characteristics of input data are examined from as wide 
range as possible, and learning data is classified into several classes. Then, the proposed method is 
applied to each of the classes. 

(2) A range of scatter of the data to be learnt is considered and design is made such that a correct 
solution is obtained if diagnosis data is within the range of scatter. 
20 (3) If a correct solution is determined after the diagnosis, the diagnosis data and the correct solution are 
made available as learning data and teacher data in the next run. 
(4) In case of diagnosis, the inference is set logically more strict. 

The present invention relates to a recognition or diagnosis method using a neural network having an 
input layer, an hidden layer and an output layer in which learning data inputted to the input layer are 

25 classified into a plurality of types in accordance with characteristics of the data, the learning data is inputted 
for each type, a total sum is determined for each case, corresponding teacher data and the total sum are 
associated in the order of magnitude for each output of the output layer, and coupling weights are 
determined based on the teacher data corresponding to the case having a maximum total sum in the above 
table, the total sums before and after the change of the teacher data and the teacher data. 

30 The present invention further relates to a recognition or diagnosis method using a neural network having 
an input layer, an hidden layer and an output layer in which input data is inputted for each value for each 
case based on an upper limit and a lower limit of the input data for each case inputted to the input layer, a 
total sum is determined for each case, corresponding teacher data and the total sum are associated in the 
order of magnitude for each output of the output layer, and the coupling weight is determined based on the 

35 teacher data corresponding to the case having a maximum total sum in the above table, the total sums 
before and after the change of the teacher data and the teacher data. 

The input data inputted to the input layer preferably has the same coupling weight multiplied to the 
upper limit and the lower limit. 

The present invention further relates to a recognition or diagnosis method using a neural network having 

40 an input layer, an hidden layer and an output layer in which a an input sum is determined by selecting 
coupling weights such that the input sum determined from a case other than a particular case inputted to 
the input layer does not fall between two input sums determined from an upper limit and a lower limit of the 
input data of the particular case, the input sum is inputted as the input data, an input total sum is 
determined for each case, corresponding teacher data and the input total sums are associated in the order 

45 of magnitude for each output of the output layer, and the coupling weight is determined based on the 
teacher data corresponding to the case having the maximum input total sum in the above table, the input 
total sums before and after the change of the teacher data and the teacher data. 

The teacher data of the particular case and the teacher data of the case other than the particular case 
are preferably different from each other. 

so Preferably, the coupling weight is selected such that the second hidden layer functions as an AND 
circuit and the output layer functions as an OR circuit when there are two hidden layers and the teacher 
data corresponding to the case having the maximum total sum is 0. When there are two hidden layers and 
the teacher data corresponding to the case having the maximum total sum is 1 , it is preferable to select the 
coupling weight such that the second hidden layer functions as an OR circuit and the output layer functions 

55 as an AND circuit. 

Further, the input data is preferably a reciprocal for each case so that the magnitude relation of the 
input data is reversed. 
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Preferably, a back propagation iterative learning method is used to calculate the coupling weights which 
are principal parts of the neural network. 

The present invention further relates to a recognition or diagnosis method using a neural network having 
an input layer, an hidden layer and an output layer in which learning data is inputted to the input layer for 
5 each case, a total sum is calculated for each case, corresponding teacher data and the total sum are 
associated in the order of magnitude for each output of the output layer, a coupling weight is determined 
based on the teacher data corresponding to the case having a maximum total sum in the above table, the 
total sums before and after the change of the teacher data and the teacher data, and the fact that the 
inputted diagnosis data is out of the range of scatter designated for the leaning data is displayed. 
10 Preferably, the teacher data and the total sums are listed in the order of magnitude to associate the 
teacher data with the total sums in the order of magnitude. 

The present invention further relates to a recognition or diagnosis method using a neural network having 
an input layer, an hidden layer and an output layer in which when diagnosis data is to be inputted and a 
correct solution to the diagnosis data is known, the diagnosis data and the corresponding correct solution 
is are transferred as a part of learning data after the diagnosis. 

The present invention further relates to a recognition or diagnosis method using a neural network having 
an input layer* an hidden layer and an output layer in which an output to diagnosis data is calculated after 
the diagnosis data has been inputted, at least one probable learning data is inferred from the output, the 
learning data is compared with the diagnosis data, and a result is displayed. 
20 Where the inferred learning data has a scatter, it is preferable to compare the diagnosis data with the 
learning data having the range of scatter and display a result. 

Preferably, the first coupling weight to be multiplied to the input data of the input layer is different from 
other coupling weights. 

Further, in accordance with the present invention, the diagnosis data and the learning data are directly 
25 compared, whether the diagnosis data and the learning data of a particular case totally match, partially 
match or totally mismatch is determined, a result is displayed, the diagnosis data and the learning data 
having a range of scatter are directly compared, whether the diagnosis data is totally in the range, partially 
in the range or totally not in the range is determined, and a result is displayed. 

Further, in accordance with the present invention, simultaneous equations are prepared by using a 
30 calculation formula for the input to the neuron based on an upper limit and a lower limit of the input data 
having a range of scatter, the simultaneous equations are solved to determine a coefficient, and the neural 
network is utilized based on the coupling weight. 

Further, the present invention relates to a recognition or diagnosis method using a neural network 
having an input layer, an hidden layer and an output layer weighted by coupling weights which has a 
35 capability to evaluate teacher data for determining the coupling weights. 

Further, in accordance with the present invention, a system has an electrical circuit to be used under a 
plurality of conditions and the circuit is diagnosed by using a neural network. The plurality of conditions are 
. taken as learning data to determine coupling weights of the neural network. 

The present invention is now summarized. A threshold modifying neuron which always outputs "1" is 
40 provided for each layer except the output layer, a total sum of inputs is calculated for each case, total sums 
are listed together with corresponding teacher data in the order of magnitude of the total sum, neurons 
equal in number to the number of times of change of the teacher data are sequentially arranged as a first 
hidden layer based on the number N of times of change of the teacher data in the list, and where N is odd, 
(N-1)/2 neurons are arranged as a second hidden layer, outputs of the first hidden layer are sequentially 
45 inputted to the second hidden layer two at a time, outputs thereof are inputted to the output layer together 
with the reaming one output of the first hidden layer. Where N is even, N/2 neurons are arranged, the 
outputs of the first hidden layer are sequentially inputted two at a time, the outputs thereof are inputted to 
the output layer, and the coupling weights are analytically determined based on the teacher data 
corresponding the case having a maximum input sum of the table and the input sums before and after the 
50 change of the teacher data and the teacher data. 

Preferably, any numbers K| which are not all equal, for example, a random number are multiplied to the 
inputs before summation instead of simple summation of the inputs for each case. 

Further, it is preferable to provide one threshold modifying neuron which always outputs "1" for each 
layer except the output layer and a total sum of products of the inputs and appropriate coupling weights is 
55 determined for each case. 

Further, input sums which are products of an upper limit and a lower limit of the input and an 
appropriate same coupling weight are prepared for the input data of each case based on the upper limit and 
the lower limit of the scatter of the input data of each case, and the coupling weights are selected such that 
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the input sum determined from other cases does not fall between two input sums determined based on the 
upper limit and the lower limit of the inputs of a particular case. Namely, the weights are determined such 
that the sum of other case does not fall in the range of the first determined case. 

Further, input sums which are products of an upper limit and a lower limit of the input and an 
5 appropriate same coupling weight are prepared for the input data of each case based on the upper limit and 
the lower limit of the scatter of the input data of each case, and the coupling weights are selected such that 
the input sum determined from other cases having teacher data other than the teacher data of a particular 
case does not fall between two input sums determined based on the upper limit and the lower limit of the 
inputs of the particular case. 

io Further, the fact of diagnosis is preferably indicated by any means such as a display screen of video 
display means or by alarm sound. 

The diagnosis data and the learning data are preferably compared for total match, partial match or total 
mismatch, and a function to classify the result is preferably provided. 

A matrix is preferably used as means for comparing data numbers of the diagnosis data and the 
75 learning data. 

(1) By the use of the analytical approach, the iterative learning required in the prior art is not necessary. 
An optimum number of hidden layers of the neural network and the number of neurons can be 
predetermined. No problem of local minimum arises. Accordingly a time from the input of data to the 
diagnosis by using the neural network can be reduced. Since the present method may be applicable 

20 irrespective of the number of input data or the number of cases, the coupling weights among the 
neurons in a large scale neural network can be exactly determined in a short time. 

(2) By using the iterative learning together with the analytical approach, the significant reduction of the 
learning time is attained. 

(3) Since the number of times of calculation is significantly reduced. The recognition or diagnosis may 
25 be implemented in an economical notebook type personal computer depending on a memory capacity 

thereof. 

(4) Based on data of a particular form after the classification, the number of cases to be compared for 
the similarity is smaller after the classification and the comparison is facilitated and the precision of 
diagnosis is naturally improved. 

30 (5) Since data in the appropriate range of scatter is accepted as a correct solution, the precision of 
diagnosis to unlearned data is naturally improved. 

(6) When a result of diagnosis is prepared and if it is a correct solution, the diagnosis data and the 
correct solution are taken as the learning data for the next run. Accordingly, the precision of diagnosis is 
naturally improved gradually. 
35 (7) By effecting the diagnosis logically more strict, the precision of diagnosis is further improved. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a configuration of a conventional neural network; 
40 Fig. 2 shows a configuration of the neural network in accordance with an embodiment of the present 
invention; 

Fig. 3 shows a flow chart of an operation of a network configuring unit shown in Fig. 2; 

Fig. 4 shows a relation between sums of input data in the respective cases and teacher data in which the 

teacher data changes once; 

45 Fig. 5 shows a configuration of the neural network of the present invention when the teacher data 
changes once; 

Fig. 6 shows a relation between total sums of input data in the respective cases and the teacher data in 
which the teacher data changes twice and a merging method therefor; 

Fig. 7 shows a configuration of the neural network of the present invention when the teacher data 
so changes twice; 

Fig. 8 shows a relation between total sums of input data in the respective cases and the teacher data in 
which the teacher data changes three times and a merging method therefor; 

Fig. 9 shows a configuration of the neural network of the present invention when the teacher data 
changes three times; 

55 Fig. 10 shows other relation between total sums of input data in the respective cases and teacher data in 
which the teacher data changes once; 

Fig. 11 shows other relation between total sums of input data in the respective cases and the teacher 
data in which the teacher data changes twice and a merging method therefor; 
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Fig. 12 shows other relation between total sums of input data in the respective cases and the teacher 
data in which the teacher data changes three times and a merging method therefor; 
Fig. 13 shows conversion between values being operated and the input and output; 
Figs. 14A to 141 show classification of learning data; 
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1) in which some data is used by reciprocals; 



1) of combination for teacher data which changed three times; 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 



25 



30 



A diagnosis/control neural network system 1 of the present invention is now explained in detail with 
reference to the accompanying drawings. 

First, a method for configuring the neural network of the present invention is now explained with 
reference to Fig. 3. 

In a step S202, the sample data sets and teacher data set for the respective cases are inputted. The 
sample data of each set for the case Xj (i = 1 to n) is any value of 03X^1 , and the teacher data for the case 
is either "0 W or "1". It has been found out that coupling weights Wi, W 2 , W 3 , . . W n may be assumed to 
be equal to each other as shown by the following equation (3) in most cases although there is a very rare 
exception (which will be discussed hereinlater). 



= W 2 = W 3 



(3) 



35 A weight W n+1 for a threshold modification input "1 " usually assumes another value. 

Taking the equation (3) into account, in the equation (1 ), a sum EX, of the input sample data X» is first 
calculated and then a coupling weight may be multiplied thereto. In a step S204, a sum of the inputs is 
calculated for each data set corresponding to each case. In a step S206, they, are rearranged in an order of 
the total sum EX| and they are displayed together with the corresponding teacher data. 

40 

Table 1 



50 



Case Number 




Teacher Data 


1' 


Si 


0 


2' 


s 2 


1 


3' 












n' 







55 After the rearrangement, the number of neurons is determined in a step S208, and the coupling weights 
to the neurons are calculated in a step S210. Then, they are set in the neural network unit in a step S212. It 
has been found thorough various considerations that the treatment in determining the coupling weights 
should be changed depending on whether the teacher data to the maximum total sum (which is designated 
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by Si) is "0" or "1". First, it is assumed that the teacher data for Si is n 0 n . 
For the convenience of explanation, a simple case will be discussed first. 

In Fig. 4, an ordinate represents Si, S 2 . S 3 S n of Table 1 and an abscissa represents "0" and "1" 

of the teacher data. In Fig. 4, the teacher data changes from "0" to Tata boundary of Si and S2. (It may 
5 change at other point). In the case of such a simple change, a hidden layer of the neural network is not 
necessary at all and the network can be configured simply as shown in Fig. 5. Coupling weights W a and W b 
of the neural network are determined in the following manner. 

Looking at the change of the teacher data in Table 1 , the sums of the inputs immediately before and 
after the boundary at which the teacher data changes as shown in Table 1, that is, Si and S2 in the present 
10 example, are used to form the following linear simultaneous equations (4) and (5) with two unknowns. 

Si -W a + LW b = -10 (4) 

S2*W a + LW b = 10 (5) 

75 

The left terms represent inputs in the cases V and 2\ and the right terms are the values of the inputs 
with which the output calculated by the formula (2) is practically "0" or "1" (Where Ui = -10, 1/(1 + exp(-(10)- 
))*0, and where U, = 10, 1/(exp(-(10)))«1 .). 

Assuming Si-S2*0, the above simultaneous equations are solved. 

20 

W a = -20/(Si-S 2 ) (6) 

W b = 10 + 20S2/(Si-S 2 ) (7) 

25 Since Si>S2, W a is negative and W b is positive. If S' is smaller than S2, the following inequality is met. 

S'-W a + LW b > 10 (8) 

Thus, the outputs of S3 et seq in Fig. 4 are closer to "1" than S2 and they can be practically assumed 
30 to be "1". It is seen that a set of simultaneous equations as shown by (4) and (5) are for one change in the 
teacher data. 

As seen from Fig. 5, the coupling weights W a and W b are uniquely determined as the solutions of the 
simultaneous equations. £X, is a variable but "1" is a fixed value. Where a sigmoid function is used as an 
output function of the neuron, a portion relating to "1" may be deleted from the circuit of Fig. 5 if a sigmoid 

35 function having an offset of (1 • W b ) along the abscissa is used. 

Let's consider a case where the teacher data starts from "0", goes to "1 " and returns to "0" again as 
shown by a segment 50 of Fig. 6. The teacher data changes two times. Since segments 51 and 52 can be 
generated in substantially same manner as that in Fig. 4, they are combined to generate the segment 50. 
Since the output is "1" only when both of the segments 51 and 52 are T, it is seen that they may be 

40 connected in an AND manner. The output of the AND circuit is "1" when both inputs are "1" so that a 
segment 53 is generated, which matches to the segment 50. A neural network for this case is shown in Fig. 
7. Coupling weights of the input layer and the hidden layer are derived by solving simultaneous equations 
similar to the equations (4) and (5) at the two changes of the teacher data. In this case, consideration is 
made for the case where the teacher data changes from "0" to "1 and the case where it subsequently 

45 changes from "1 " to "0". The neuron shown by hatching at the bottom of the hidden layer is for threshold 
modification. When "20" is designated as the coupling weight between the input layer and the hidden layer 
as shown, the output is always practically "1". The coupling weights between the hidden layer and the 
output layer in an AND manner can de determined in a similar simple way. For example, they may be "20", 
"20" and "-30" as shown in Fig. 7. 20 + 20-30= +10 and 20-30 = -10, which correspond to " + 1" and "0", 

50 respectively. As seen, "-30" is a bias unit. An OR and (-10) shown in Fig. 7 will be explained hereinlater. 

Let's consider a case where the teacher data changes three times as shown by a segment 70 of Fig. 8. 
Since segments 71 and 72 are generated in a similar manner to that described above, a segment 73 may 
be generated by AN Ding them. A segment 74 is generated in a similar manner to that for the segment 71 
and the segments 73 and 74 are coupled in an OR manner which outputs "1" when at least one of the 

55 inputs thereto is "1 ". In this manner, a segment 75 is generated. It matches to the segment 70. 

A neural network therefor is shown in Fig. 9. Again, the neuron shown by hatching at the bottom of the 
hidden layer, is a bias unit and it always outputs "1 ". Third output from the top of the first hidden layer is 
directly connected to the output layer without routing the AND operation neuron. The coupling weights of 
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the OR operation neuron can be determined in a similar manner to that described above. They may be 
"20", "20" and "-10" as shown in Fig. 9 so that "1" is outputted when at least one of the inputs to the 
output layer is "1 ". (-10) is a bias unit. (OR) and (-10) of the hidden layer and (AND) and (-30) of the output 
layer shown in Fig. 9 will be explained herein later. 
5 Similar analysis is conducted while the number of times of change of the teacher data is incremented to 
derive Table 2. 

Table 2 



The number of times of change of teacher data 
and the number of neurons of hidden layer 



75 


Number (n x ) of times 
of change of teacher 
data 


1 


2 


3 


4 


5 


6 


7 


Hi 




Number (n 2 ) of neurons 
of first hidden layer 


(1) 


2 


3 


4 


5 


6 


7 


Hi 


20 


Number (n 3 ) of AND 
Alteration neurons of 
second hidden layer 


0 


(1) 


1 


2 


2 


3 


3 


*i 


25 


Number (n 4 ) of OR 
operation neurons of 
output layer 


0 


0 


1 


1 


1 


1 


1 


1 



30 

Table 2 shows a relation between the number of times of change of the teacher data and the number of 
neurons of the hidden layer. The number n2 of neurons of the first hidden layer, the number n 3 of AND 
operation neurons (circuits) which are the second hidden layer, and the number n* of OR operation neurons 
(circuits) which are the output layer are shown for the numbers "1 " to "7" of times of change of the teacher 

35 data and a general number ni (The neuron for modifying the threshold is separately provided for the hidden 
layer.). As shown in Table 2, the number of neurons in the first hidden layer is equal to the number ni of 
times of change of the teacher data. Where the number ni of times of change of the teacher data is "1 ", 
the neuron of the first hidden layer are the neuron of the output layer itself, and where the number ni of 
times of change of the teacher data is "2", the neurons of the second hidden layer are the neurons of the 

40 output layer. It is convenient to consider that they are the neurons of the hidden layer for the number "1" in 
the parentheses of Table 2 than to express it as no hidden layer, when a method for calculating the 
coupling weight is to be considered. It has been proved by the analysis that it is sufficient to consider two 
hidden layers at most. As described above, when there are a plurality of outputs from the output layer, the 
above calculation method is repeated. 

45 More generally, where the number of times of change of the teacher data is relatively large, the 
configuration of the neural network of the present invention is one shown in Fig. 2. Where the number of 
times of change of the teacher data is odd, there is a direct connection from the first hidden layer to the 
output layer without routing the second hidden layer. The coupling weights are determined in the manner 
described above. 

so Let's consider a case where the teacher data corresponding to the maximum sum Si of the inputs is 
"1 " as shown in Table 3. 



55 
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Table 3 



Case Number 




Teacher Data 


1* 


Si 


1 


2' 


s 2 


0 


3' 


s 3 










n' 







75 

Fig. 10 shows a case where the teacher data changes once. In Fig. 10, the teacher data changes 
between Si and S2 although it may change at other point. The neural network in this case requires no 
hidden layer and it may be the same one as that shown in Fig. 5 but the coupling weights are different, 
which are determined by simultaneous equations similar to the equations (4) and (5) but the sign of the right 

20 term is changed to render the outputs to be "1 " and "0". 

Fig. 11 shows a case where the teacher data changes twice as shown by a segment 100. Since 
segments 101 and 102 are generated in the same manner as that described above, the segment 100 may 
be generated based thereon. When at least one of the segments 101 and 102 is "1", the output is to be 
"1". That is, they are coupled in an OR manner. In this case, the neural network is substantially identical to 

25 that of Fig. 7 but the output layer functions as the OR operation instead of the AND operation and the 
coupling weight for the threshold modification is changed from --30" to "-10", for example. 

Fig. 12 shows a case where the teacher data changes three times as shown by a segment 110. In this 
case, segments 111 and 112 are generated in the same manner as that described above, and they are 
ORed to generate a segment 113. Then, a segment 114 is generated in the same manner as that for the 

30 segment 111. Finally, the segment 113 and the segment 114 are coupled in an AND manner to generate a 
segment 115. It matches to the segment 110. In this case, the neural network is substantially identical to 
that of Fig. 9 but the OR operation neuron is arranged in the second hidden layer and the coupling weight 
for modifying the threshold is changed so that the output layer functions as the AND operation. The result 
of the analysis while incrementing the number of time of change of the teacher data is summarized in Table 

35 4. 



Table 4 



40 


Number (n 5 ) of times 
of change of teacher 
data 


1 


2 


3 


4 


5 


6 


7 


n 5 




Number (n 6 ) of neurons 
in first hidden layer 


(1) 


2 


3 


4 


5 


6 


7 




45 


Number (n 7 ) if OR 
operation neurons of 
second hidden layer 


0 


(1) 


1 


2 


2 


3 


3 




50 


Number (n 8 ) of AND 
operation neurons of 
output layer 


0 


0 


1 


1 


1 


1 


1 


1 



* a :fl,--|5-i<l-<-l>«*) 



Table 4 shows the number n 6 of the neurons of the first hidden layer, the number n 7 of OR operation 
neurons as the number of neurons of the second hidden layer, and the number ns of the AND operation 
neurons in the output layer, for the numbers "1" to "7" of times of change of the teacher data and a 
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w 



15 



20 



qeneral number r* when the teacher data of S, is "1". Like in Table 2. the neuron for ™d.fy.ngjhe 
Teshold is separate., provided for the hidden layer and the number "1" in the parentheses also represent 
me outpJs oTthe output layer. By comparing Table 2 and Tab.e 4. it is seen that the AND and OR are 
cimnlv reDlaced In this case, two hidden layers at most may be considered. 

P l e P ?enta"y where the number of times of change of the teacher data is relatively large the 
conftou ation of the neural network is like that shown in F,g. 2. When the number of times , of change of toe 
teaS data is odd there is a case where direct connection is made from the first hidden layer to the 
routing the second hidden layer as shown by a broken line. The second hidden layer 
Sdes the OR operation neurons and the output layer includes the AND operation neurons. The method 
fnr ralculatina the coupling weights is same as that described above. 

The re a p uraS of outputs are required for the output layer, the analysis is repeated for each output m 
the o^ess described above. When the outputs are to be determined, a portion of the neural network may 
be shaded «?he boundary of the change of the teacher data shown in Table 1 and Table 2 are same. Thus, 
the entire configuration may be simplified by utilizing th.s 'act. nnliatinns are 28 ro 

In the equations (6) and (7), when S, and Sh are equal, the denominators of those equations are zero. 
,n sLch a case, instead of simply summing the inputs, a number K, which * determine^ I from a random 
number and which is not "1" is multiplied to each input before the summation. Namely. EK* are calculated 

and toen nanXd in the same manner as S,. S 2 . S, S n of Table 1 and Table 3. By doing so 

riSTSs^Sat the denominators of the equations (6) and (7) are zero, except when the inputs o the 
17Z oT casel are tota.ly equal. If the denominators of the equations (6) and (7) are sh.l zero a prccess 
o deriving another random number may be repeated. As is apparent from the previous descnption the fart 
°hat the denominator is zero causes a problem only when S, and S, are equal and the oorresponding 
eacher data are different Even if S, and & are equal, no problem is raised ,f the corresponding teacher 
0?!?^ ^^^ ^ are not used for the calculation of the coupling weights. Accordingly more 
2S generXTtable 2 and Table 4. EX, is not calculated as a simple summation but ,s calculated by 
using the number K, determined from the random number and the calculation may be proceeds I in toe 
manne described above. In this case, the coupling weight for each input X may be con.dered to be 
When K| is used as the weight factor, it can be readily attained by adding a neuron to toe input layen I Ms 
Tssumed that S,' and S 3 ' are equal, the corresponding teacher data are equal, and here is a sum & 
™ whichTs between S,' and S 3 ' and has the teacher data equal to the teacher data for S,' and S 3 . In such a 
caTe as fsTpparen from the above equations, there is no change among the three teacher data and hence 
S?!s no used tor he Calculation of the coupling weight. Accordingly, it is seen that the data S." does not 
exis as the dlnosis data and it is not worthwhile adding. It may be deleted to save the memory space If 
mTnew.raddenacher data changes the boundary of the change of the teacher data, it ,s worthwhile 

^T^accordance with the present method, the following advantages are offered: 
m The iterative leaning required in the prior art is not necessary. 

2 Snce C^urTr of hidden layers, the number of neurons required therefor and the coupling weights 
can be analytically determined, highly efficient diagnosis is attained in a short time. 
40 (3) It has been proved that the number of hidden layers may be two at maximum 

(4) Since the coupling weights are analytically determined in a simple manner. ,t is apphcable to a large 
scale svstem having a number of input data and output data. 

(5) Since toe iterative learning required in the prior art is not necessary, more economical computer may 

be used for the recognition and diagnosis. ^„„„ cic » ran he determined 

<6> When the input data is to be added to enhance the precision of diagnosis, it can be determinea 
Setoer ?t ^worthwhile adding or not based on the input data and the teacher data and if it is not worth, 
thft addition is stooped. Accordingly, the memory can be efficiently used, 
to toe equations (4^ and (5). the output "0" or "1" is approximated from the neuron. In a conversion 
shown Tn Rg U Considering \ a positive number close to zero. < and (1-0 are used for the -ntermediate 
so aSl instead of "0" and "1 and in a fine, output stage it ■■ ™J ~ 

"0" to "1 ". When this is applied to the right terms of the equations (4) and (5), they are expressed by 

Si«W a + 1«W„ = log( e /(1-0) (9) 
ss S2.W n + 1»W b = -log(«/(1-<)) (10) 

where log is natural logarithm. In the method in which the output of the equation (2) is treated as ; "0" J j>r "1" 
for praSca! reason, approximation calculation is partially included and hence an error ,s introduced. The 
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15 



20 



25 



present method includes no such error. Pine 14A to 141 which 

A further embodiment of the present invention is expired with reference to F *« m ^ A ^V" ~ 
illustrate the classification of the learning data. An abscissa represents an input data number .of the ^same 
Sse and an ordinate represents a magnitude of corresponding input ** x * num^r of e amp, s a e 
classified based on characteristics thereof as shown in Figs. 14A to 141. In Figs. 14A to 140. mere i is o 
tS and they may be classified in detai. based on the magnitude of peak, the value n for the peak and a 
££■ gradient (convex upwardly or concave downwardly) although it is n °V^ftldTah peaks ie 
mere are two peaks and they may be classified based on the magnitudes of the left and r gh peaks the 
Situder 0 Uhe ^aks and'va.leys, and the va.ue n for the peak or valley (whether the n or the vaHey , 
Sr to the n for the left peak or the n for the right peak) although .t .s not shown. In Figs. 14G to VM. 

L three oeaks and they may be classified from various aspects as described above The 
T S «ZT:jT«T™:: b J<> on shapes such as rectangular and pedestal although ., ,s not 

Sh ° The entire input data for learning is classified in accordance with Figs. 14A to 14. and a confi9"retio" of 
a muW-lwer neural network and coupling weights among the neurons may be analytically determined Un 
accolS wUh me above method. In the above analytical method, the configuration - ^J^ 
and the coupling weights among the neurons are normally determined .n several to several tens seconds 
altnouohTdepends on the scale of data. Since the number of cases belonging to a particular class is 
sS than that when no classification is made, the comparison is facilitated in the d.agnosis , and the 
domination of confusing and similar data is relatively easy, and the precision of diagnoses .s improve* 
ST!,*! Lrning method which requires a number of times of learning by the prior art back 
^ g «^S is leant after such classification, the time ^^ZTl^^ 
Increases in many times. Accordingly, it is generally difficult to apply the present method to back 
propSation .eS method after the classification of the input data. However, depending on t he input data. 
STy £ utilized for the back propagation learning method to reduce the overall learning tme 

A further embodiment of the present invention is explained with reference to Figs. 15 and I 16. In the 
preirl ^Ib^Z anafyticaLethod explained in Figs. 2 to 13 is expanded with the cons.deration of 

^ In C Rg T^TscfsSrepresents an input data number n and an ordinate represents a value xo. the 
in p ut a 'hey do in Figs. 14A to 14.. Two measured curves F and G with scatter are shown^An upper hmrt 
of Fte represented by F u . a lower limit by F d . an upper limit of G by G u and a lower limit by G d . 

in Z explanation of the present invention, it is assumed *a, the input x ,s any 
number of output (teacher data) of the neural network is Land the value thereof ,s 0 or 1 . Where there 

- rJ2r£^«™ — - — G is il outputs 

" 1 " Total sums of inputs Sf u . Sf d , Sg u and Sg d excluding the threshold modifying input with arbitrarily 
determined weights (coupling weights) K, are determined by formula (1 1 ) as they 



50 



55 



i-l 



(11) 



are in Table 1 . 
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Table 5 



10 



Input Total Sums and Teacher Data Rearranged in the Order of Magnitude 


Case No. 


Total Input Sum 


Teacher Data 


Remark 


r 


Sf u 


0 


Substitute by Si 


2' 


Sf d 


0 


Substitute by S2 


3 f 


Sg u 


1 


Substitute by S3 


4' 


Sg d 


1 


Substitute by S* 



75 



20 



As shown in Table 5, the data are rearranged in the order of magnitude and they are displayed together 
with the corresponding teacher data. After such rearrangement, it is necessary to change the weights 
(coupling weights) of the second hidden layer et seq of the neural network depending on whether the 
teacher data having the maximum input total sum is "0" or "1 For the input F and G having a scatter, in 
order to get a correct solution as an output of the neural network if it is within the range of scatter, it is 
necessary that either a formula (12) or a formula (13) is met when the weight (coupling weight) to the input 
is positive or "0". 

Sf u * Sf d 2i Sg u £ Sg d (12) 



Sg u * Sg d * Sf u * Sf d (13) 



30 



35 



40 



45 



50 



55 



Depending on the manner of change of a curve, the formula (12) or (13) may be met even if all weights 
(coupling weights) to the inputs x are selected equal. However, depending on the curve, the value of other 
curve (for example, Sg u ) may fall between sums of one curve (for example, Sf u and Sf d ). In order to assure 
that the magnitude relation of the formula (12) or (13) is positively met, the following methodology is used. 
First, as candidates for the weights (coupling weights) Kj to be used, 0.0001, 0.001, 0.01, 0.1, 1.0, 10.0, 
100.0, 1000.0, 10000.0 etc. are prepared, and the right side of the crosspoint of the curves F and G of Fig. 
15 is weighted with very large weights (coupling weights) K| compared to the left side (for example, 10 to 
100 times) so that the total sums of inputs multiplied by the weights (coupling weights) are emphasized on 
the right side of the crosspoint of the curves F and G and the magnitude relation of the formula (12) in 
which the sum for the curve F is larger is met. 

On the other hand, if the left side of the crosspoint of the curves of Fig. 1 5 is weighted with very large 
weights (coupling weights) Kj compared to the right side (for example, 10 to 100 times), the total sums of 
the inputs multiplied by the weights (coupling weights) are emphasized on the left side of the crosspoint of 
the curves F and G so that the magnitude relation of the formula (1 3) in which the sum of the curve G is 
larger is met. 

A magnitude relation should not be such that a total sum determined by other curve falls between two 
total sums determined by one curve. 

When the weight (coupling weight) is positive or n 0 n for the data in the range of scatter shown for the 
curve F, the sum of the products of the inputs x and the weights (coupling weights) Kj is always between 
Sf„ and Sd d . Similarly, when the weight (coupling weight) is positive or "0" for the data in the range of 
scatter shown for the curve G, the sum of the products of the inputs x and the weights (coupling weights) K| 
is always between Sg u and Sg d . Accordingly, as will be explained later, by selecting the weights (coupling 
weights) to meet the formula (12) or (13), the same output as that designated by the curves F g and F d is 
attained for the input x in the range of scatter of the curve F. Similarly, the same output as that designated 
by the curves G u and G d is attained for the input x in the range of scatter of the curve G. 

For the purpose of simplification, it is assumed that the formula (12) is met, the output (teacher data) is 
n 0 n for the curve F and the output (teacher data) is "1 " for the curve G. 

In order to make it easy to process the formulas of the Table 5, the characters are substituted as 
follows. 

Sf u -»Si , Sf d ~»S2, Sg u -*S3, Sg d -»S* 



In Fig. 16, an ordinate represents Si, S2, Sa and S4 of Table 5 and an abscissa represents the teacher 
data. The teacher data changes only once at the boundary of S2 and S3. In order to generate this simple 
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change, no hidden layer is required in the neural network and a simple construction shown in Fig. 17 is 
sufficient. This is different from Fig. 5 in that the inputs are weighted. The coupling weights W a and W b of 
the neural network are determined as follows. 

Based on the change of the teacher data of Table 5, the following simultaneous equations are prepared 
5 by using the input total sums S2 and S3 immediately before and after the boundary of change of the 
teacher data shown by broken lines in the table while referencing to Fig. 17. 

S2-W a +1.W b = -10 (14) 

10 S 3 -W a + 1.W b = 10 (15) 

The value of the left term is the input to the output layer, and the value of the right term is an input 
value which allows that the output calculated by the formulas (1) and (2) be practically considered as n 0 tt or 

fl r. 

75 (Because 1/(1 + exp(-(-1 0))M), 1/(1 + exp(-(10)))»1, see formulas (1) and (2)) 
By solving the above simultaneous equations (S2-S 3 *0), 

W a = -20/(S 2 -S 3 ) (16) 

20 W b =10 + 20S 3 /(S2-S 3 ) (17) 

Since W a is negative and W b is positive, the following inequality is met if S' is smaller than S 3 . 

S'.W a + 1.VvV>10 (18) 

25 

Accordingly, the output to the input total sum which is smaller than S 3 of Fig. 16 is closer to "1" than 
that for S 3 and it may be practically considered as T. 

When S" is larger than S2, the following inequality is met. 

30 S"»W a +1.W b <-10 (19) 

Accordingly, the output to the total input sum which is larger than S2 of Fig. 17 is closer to "0" than 
that for S2 and it may be practically considered as "0". 

Thus, for the input in the range of scatter of the curve F or G of Fig. 15, "0" or "1" output is produced. 

35 Fig. 18 shows an example in which the curves F and G with scatter overlap in most areas. In such a 
case, if the weights (coupling weights) K, for the inputs are equal, the total sums of the inputs weighted by 
the coupling weights are almost equal depending on the shapes of the curves and the magnitude relation of 
the input total sums may not meet the above required relation. 

However, if the weight (coupling weight) Kj at one point in the non-overlapping area of the curves F and 

ao G is set sufficiently larger than the weights (coupling weights) Kj of other area (for example, 10 to 100 
times), the affect of the value of that area appears prominent so that the magnitude relation of the formula 
(12) or (13) is met. For example, if only the weight (coupling weight) Kj in the range A of Fig. 18 is selected 
sufficiently large, the sum for the curve H becomes dominant and the formula (12) is met, and if only the 
coupling weight in the range B is selected sufficiently large, the sum for the curve G becomes dominant so 

45 that the formula (13) is met. If the weight Kj in the range A is selected larger than those of other area and 
the weight K { in the range B is selected smaller than those of other area, the curve F is more dominant. The 
opposite selection may be made. 

Fig. 19 shows another approach for a similar case. In order to simplify the expression, the range of 
scatter is omitted in the drawing. In Fig. 19, again the abscissa represents the data number n and the 

50 ordinate represents the input data x. Two type of data are shown, one by circle and the other by X. Most 
data are overlapped but three data are not coincide to each other. Two data marked with X are smaller than 
the data marked with circle and one data marked with X is larger than the data marked with circle. In this 
case, if the scatter not shown is considered, the total sums of the products of the input data x and the 
weights (coupling factors) Ki of the respective cases are almost equal if the weights (coupling factors) K, are 

55 simply made equal and the magnitude relation of the formula (12) or (13) may not be met. The approach to 
partially increase the coupling weights to the input data has been described above. In Figs. 19-21, another 
method for meeting the magnitude relation of the formula (12) or (13) even if all coupling weights to the 
input data are equal is shown. 
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In Rg. 20, the data marked with X which is larger than the data marked with circle is not used as it is 
but the data marked with circle and X for that data number n are used by reciprocals. The reciprocal of the 
data marked with circle is represented by dotted circle. The reciprocal of the data marked with X is 
represented by A. In the case classified by the mark X, two data marked with X and one data marked with 

5 A are smaller than the data marked with circle (including dotted circle) of the same data number. 
Accordingly, even if all weights (coupling weights) K| are selected equal, the total sum of the input marked 
with circle ia larger and it can be discriminated. 

In Rg. 21 , the only one data marked with X which is larger than the data marked with circle is left as it 
is, and the two inputs for the n marked with X which are smaller than the data marked with circle as well as 

10 the data marked with circle are used by reciprocals. As a result, two data marked with A and one data 
marked with X are larger than the data marked with circle of the same data number. Accordingly, even if all 
weights (coupling weights) K| are selected equal, the total sum of the inputs marked with circle is smaller 
and it can be discriminated. 

Rg. 22 shows four cases having scatters. In this case, a range of the data numbers such as a range A 

J5 in which the data of the four cases assume different values from each other is selected, and the weights 
(coupling weights) K, for the input data in that range are selected sufficiently larger than those of other 
range. Thus, the total sums of the inputs weighted by the weights (coupling weights) Kj are larger in the 
order of F, G, H and I, and the sum of the upper limits of the scatter of the data is smaller than the lower 
limit and the order of the total sums of the four cases is not disordered. 

20 Assuming that the output (teacher) data for the input data F, &, H and I are "0\ "1 "0" and "1", the 
relation of the input total sums and the teacher data in the order of magnitude of the input total sum is 
shown in Table 6. The symbols for the input total sums are shown by Sf u , Sf d , Sgu, Sg d , Shy, Sh d , Si u , Si d 
with the suffixes u and d representing the upper limit and the power limit, respectively, as they do in the 
previous example. This relation may be represented by a segment graph as shown in Fig. 23. 

25 As shown by a segment 220, the teacher data starts from 0, passes through 1 and returns again to 0 
and finally becomes 1. the teacher data changes three times in this example. Segments 221 and 222 are 
generated in substantially same manner as that of Fig. 4 and they are combined to generate a segment 
223. Since the output may be "1 " only when both of the segments 221 and 222 are "1", it is seen that they 
may be connected to an AND circuit. A segment 224 is generated in the same manner as that for the 

30 segment 221 . Finally, the segments 223 and 224 are combined by an OR circuit. Since the output of the 
OR circuit is "1" when at least one of the inputs is "1", a segment 225 is generated. It coincides with the 
segment 220. 

If the output (teacher) data for the input data F, G, H and I are reversed to "1", "0", "1" and "0", 
respectively, the relation between the input total sum and the teacher data is that shown in Table 7. This 

35 relation may be represented by a segment graph shown in Fig. 24. As shown by a segment 230, the 
teacher data starts from "1", passes through "0", returns again to "1" and finally becomes "0". The teacher 
data changes three times in this example. Since segments 231 and 232 are generated in the same manner 
as that of Fig. 4, the segments 231 and 232 are connected to an OR circuit to synthesize a segment 233. 
Then, a segment 234 is generated in the same manner as that for the segment 231 . Finally, the segments 

40 233 and 234 are combined in an AND circuit to generate a segment 235. It coincides with the segment 230. 
In the above example, if the total sum of the products of the inputs and the weights (coupling weights) 
Ki is between Sf u and Sf d , between Sg 0 and Sg d , between Sh u and Sh d , or between Si„ and Si d , the output 
(teacher) data is designated. However, the output data is not designated when the total sum is between 
different curves, that is, between Sf d and Sg u , between Sg d and Sh u , or between Sh d and Si u . In this case, 

45 the following process may be effected. 

(a) Values of the both boundaries are outputted in accordance with a given formula. 

(b) "? n is added in the output display to indicate non-inference. 

As an example of (a), a value determined linearly or by a sigmoid function between n 0 n and "1 w is 
outputted for a non-designated range between the output designations "0" and "1" in accordance with a 
so deviation from the boundary. 

Rg. 25 shows a configuration of the neural network corresponding to Rgs. 23 and 24. In a bottom line 
of the hidden layer, a neuron shown by hatching which always outputs "1" for modifying a threshold is 
coupled, and the input of the first hidden layer is coupled to only the threshold modifying input "1". 

When the total sum of the inputs weighted by the weights (coupling factors) Ki corresponds to the 
55 teacher data n 0 n as shown in Rg. 23, it is requires that the second hidden layer functions as the AND 
circuit and the output layer functions as the OR circuit. On the other hand, when the maximum total sum if 
the inputs weighted by the weights (coupling weights) Kj corresponds to the teacher data "1" as shown in 
Rg. 24, it is required that the second hidden layer functions as the OR circuit and the output layer functions 
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as the AND circuit. 

As seen from the those figures, only the coupling weight for modifying the threshold changes between 
the AND circuit and the OR circuit. An equivalent coupling weight to the input x is a product of the weight 
(coupling weight) K| used in the formula (11) and the coupling weights W a and W b determined by the 
5 formulas (16) and (17). 

Table 6 



Four input cases in which teacher data starts from 0 


Case No. 


Input Total Sum 


Teacher Data 


r 


Sf u 


0 




Sf d 


0 


2* 


Sg u 


1 




Sg d 


1 


3' 


Sh u 


0 




Sh d 


0 


4' 


Si u 


1 




Si d 


1 


Table 7 


Four input cases in which teacher data starts from 1 


Case No. 


Input Total Sum 


Teacher Data 


V 


Sf u 


1 




Sf d 


1 


2' 


Sg u 


0 




Sg d 


0 


3' 


Sh u 


1 




Sh d 


1 


4' 


Si u 


0 




Sid 


0 



40 

The present invention is also applicable to cases similar to Tables 6 and 7 where the teacher data 
changes in more complex and more frequent manner. 

Another embodiment which is applicable when a correct solution is known as the output for the 
diagnosis data is now explained with reference to a flow chart of Fig. 26. 
45 When the calculation is started, learning data, teacher data therefor and a number of values which are 
candidates for the weights (coupling weights) K 4 are read as learning data (1) from a memory, a hard disk 
(HD) or a flexible disk (FD) of a computer. 

Then, whether data to be added as the leaning data (2) is present or not is determined. Since there is 
no additional data initially (TUIKA$ = NASHl$), the calculation of a total sum is initiated. Total sums of inputs 
so weighted by appropriate weights (coupling factors) K, inthe manner described above are determined and 
they are arranged together with the corresponding teacher data in the order of magnitude of the input total 
sum, and the number of times of change of the teacher data is considered. An output layer comprising the 
number of neurons of the first and second hidden layers determined by the number of times of change of 
the teacher data plus one neuron is prepared and the coupling weights are determined in the manner 
55 described above. In this step, the step corresponding to the learning in the prior art reverse propagation 
learning method is completed. 

Then, a screen of a CRT (cathode ray tube) inquires the need for diagnosis. If Y (YES) is entered and 
the diagnosis data is inputted, the corresponding values are read from the memory, hard disk (HD) or 
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flexible disk (FD) of the computer. Then, they are diagnosed and the result is displayed. Then, if Y is 
entered to the inquiry "Correct solution known?" on the CRT, the characters TUIKAS are changed to ARI$ 
and the correct solution as well as the data inputted for the diagnosis are saved in the HD or FD as the 
learning data (2). Then, if Y is entered to the inquiry "Terminate?" on the CRT screen, the diagnosis is 
5 terminated. 

While not shown, when the configuration of the neural network and the coupling weights have been 
determined, various values including the coupling weights may be saved in the HD or FD and necessary 
calculation loop may be modified for use in the next diagnosis to improve the efficiency. 

When the calculation is resumed in the next run, the calculation is executed in the substantially same 
io manner. As to the data diagnosed in the previous run and for which the correct solution has been inputted, 
it is added to the input data in the step of the learning data (2) so that the learning is done better and the 
correct solution is naturally given to the same inquiry as that of the diagnosis data correctly solved in the 
previous run. In this manner, the precision of diagnosis is enhanced gradually. In the present system in 
which the configuration of the neural network and the coupling weights are analytically determined, the 
15 diagnosis may be resumed with a higher precision in several to several tens seconds after the addition of 
the learning data, although it depends on the scale of the number of data. 

Table 8 show various propositions used in general inference. 

Table 8 

20 

(1) If A, then B 

(2) If not B, then not A 

(3) If B, then A 

25 For the proposition (1 ) of Table 8, the proposition (2) of Table 8 is referred to as a contraposition which 
is a proposition which uses the negation of a conclusion for one proposition as a hypothesis and the 
negation of the hypothesis as a conclusion. The true and false of the original proposition and the 
contraposition always match. On the other hand, the proposition (3) of Table 8 is referred to as a converse 
to (1). The converse is a proposition derived by inverting the hypothesis and the conclusion of one 

30 proposition. Even if the original proposition is true, the converse is not always true. 

In the prior art neural network, which of the input learning data the diagnosis data is similar to is 
determined depending on which of the previously learnt output (teacher) data the output data at the time of 
input of the diagnosis data is similar to. As seen from the general description of Table 8, when (1) of Table 
8 is "If the learning data ia A, then the output data is B", the use of (2) of Table 8 is always true but the use 

35 of (3) of Table 8 is not always true, in a strict sense. (3) of table 8 includes those whose conclusions are A if 
the hypothesis is B and those whose conclusions are not B. However, in the neural network, there is a case 
in which a conclusion which is not A but similar to A may be treated as A without problem. 

An example is given. Fig. 27 shows input data of a neural network in one case. An abscissa represents 
a date number n and an ordinate represents input data x. The learning of data marked with circle in Fig. 27 

40 corresponds to A of Table 8 (1) and the output (teacher) data therefor corresponds to B of Table 8 (1). It is 
assumed that the diagnosis data are mostly equal to the data marked with circle but only data numbers i 
and k are different. In Fig. 27, the data which coincide with the data marked with circle are omitted for the 
expression of mark X. In the diagnosis data, the i-th and k-th data are represented by (Xj + Ax t ) and 
(x k + AxO, and the weights (coupling weights) for the input data are represented by Ki and A difference 

45 AS between total sums of the two inputs weighted by the weights (coupling weights) is given by: 

AS = Ax l *K i + Ax k -K k (20) 

If K t and K k have been set to appropriate values for the learning data marked with circle of Fig. 27, it is 
50 easy to give one of Axi and Ax k to determine the other so that AS becomes zero. In this case, even if the 
learning has been made by using only the learning data marked with circle and even if the data which is 
partially out of the range of the circle marks of Fig. 27 is diagnosed, the total sums weighted by the weights 
(coupling weights) are equal for the case of learning and the case of diagnosis, and hence the output data 
are equal for both cases. Namely, even if the output data are equal, the input data are not always equal. 
55 This correspond to "the converse is not always true" as described in conjunction with Table 8. In this case, 
some are inferred from the output data as anticipated and the other are non-anticipated ones. However, in 
the neural network, even if unlearned data is inputted, it may be regarded to be almost same as the learnt 
data if the output is very similar to that for the learnt data. In odder to further enhance the precision of 
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15 



20 



diagnosis, however, it is necessary to distinguish it from the output resulting ,n in ? ut da,a - 

method for positively eliminating the nonanticipated data is explamed with reference to F.g. 27. 

Te datVfor the diagnosis Ts inputted to calculate the output and most probable learning data s mferred 
fcomTn^ol^SSi .t is assumed that the data inferred to be most probable is the data marked 
a wrth circle* Fig 27. Then, whether the data marked with circ.e in Fig. 27 match to the ^diagnosis date o 
not is determined for each data number. If the learning data and the diagnosis data match each other for aH 
rtne^S daS iHs determined that the diagnosis data is equal to one for the selected learning case^H a 
plrfon of le input data does not match, it is determined that the diagnosis data .s different from one for 
the selected leaning case. In this manner, the precision of diagnosis is enhanced. 
t0 S is deterged by the above diagnosis that the diagnosis data and the learning data do not 
completely match No match with leaning data" may be indicated as well as "S.milar cases j* , a 
ofle number of matching data to the total number of inputs per case may be indicated. Th.s 
will give more kind reply to a user than simply indicating "No match w.th learning data . 

in the above example, only two data do not match to the data marked w.th cirde of F,g. 27. When more 
data do not match to the data marked with circle, a similar method may be applied. 

The ^malion of the diagnosis when the scatter of the learning data is taken into cons.derafion s 
now exp.red^th reference to Fig. 28. In Fig. 28. an abscissa represents the data number and an ordinate 
epresem tS Input data. A curve F u shows upper limit input data of a curve F in one case and a curve F, 
shows lower limit data of the same case. It is assumed that the curves F u and F d o F.g.28 have i been 
sefectedT the most probable case after the input of the diagnosis data and the calculation of the output 
Se L the case of Fig. 27. the fol.owing calculation for confirmation is made. Where n, is the number of 

m %XoZr ?£ number r* of diagnosis data (marked with X) which are on or below the curve F M . and 
(2) determine the number n 3 of diagnosis data which are on or above the cun/e F, 
If n, n 2 and n 3 are all equal, it is determined that the diagnosis data are in the range of the curves F u 

determined that a portion of the diagnosis data is out 
of tne^?oTsJL7e^rc.ed by the curves F u and F d . In such a case, it is indicated that the diagnosis 
a at do not completely match to the learning data as well as the percentage of «te d,agnos,s date Ao he 
input data which are between the curves F u and F d are indicated as explained ai HQ. 27 so that the ^degree 
Srniladty to the learning data is indicated for the convenience of the user. In th.s manner, exact d.agnos.s 
may be attained even for the input data having the scatter. f „,, nwinn one 

A specific method for comparing the diagnosis data and the learning data may be the following one 
instead of the steps (a) and (b) described above. 

(a 1 ) Determine the number of diagnosis data which are above the curve F u o Fig. 28. 
(b') Determine the number of diagnosis data which are below the curve F d of F.g. 28. 
If both of the numbers of data determined in the steps (a') and <b') are zero, it .s determined that the 
diaonosis data is within the range of scatter of the learnt data for a case. 

T Fig 28 he upper limit F u and the lower limit F d of the range of scatter may be set somewhat larger 
than an actual scatter which may be experimentarily determined in order to prepare for any future possible 
X£«^t£^£ some reason, .f the diagnosis data does not completely match to the learning 
date, not on* the number of data which are out of the range of scatter but also the degree of deviation may 
be included to affect to the indication. 

The methods described in Figs. 27 and 28 may be applied to the following cases. 
as (1 ) Application to a reverse learning method of a multi-layer neural network 

S Application to the present invention to analytically determine the configuration of the multilayer neura. 

7SS££Z£ neJfnetork but repeat the confirmation as described above as to the similarity 

. v^stjz star.ssr-^. - * - rT 

numbers may be compared by prepared software, or a diagnosis data group and a learning data Qoupofa 
case^ay be co!le?tive.y compared by a matrix or a matrix formula to indicate a group having a va.ue equal 
to or smaller than a value of other group, or count the number thereof. 

ThT above cLs (1) to (3) are explained in the reverse order of (3) to (1) for the convenience of 
ss exolLation "n tS^eiod (3 . to which learning case the diagnosis data is most similar (including totally 
case) is determined fo the input data of all cases. Accordingly, where the number of cases is .arge 
an ^ ttte number of Za per case is large, some time is required for the calculation for the diagnosis 
Holever in the prior art reverse propagation learning method, where the learning takes too much time or 
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the learning does not proceed because of local minimum, this method may be practiced if a high speed 
operation is not absolutely required. Since the correctness of diagnosis rather than the calculation speed is 
frequently required in the fault diagnosis, this method may be practical in many cases. 

In the method (2), the diagnosis data is inputted, the output is calculated, the similar learning case is 
5 selected and the confirmation calculation is made in the manner described above. Since the number of 
times of calculation is smaller than that of the method (2), the calculation time is shorter and the diagnosis 
is more correct. 

In the method (1), the confirmation calculation shown in Figs. 27 and 28 is required in the worst case 
(where the configuration of the neural network is very simple, for example, the number of hidden layer is 

io one and the number of neuron of the hidden layer is one). However, where the neural network is more 
complex, the confirmation calculation is possibly not required. This is explained below assuming that the 
number of hidden layer is one. 

Where the number of neurons of the first hidden layer is more than one, the coupling weights between 
the neurons of the input layer and the first neuron of the first hidden layer are generally different from the 

T5 coupling weights between the input layer and the second neuron of the first hidden layer. Accordingly, even 
if AS given by the formula (20) is zero with respect to the coupling weights between the input layer and the 
first neuron of the first hidden layer, AS 1 calculated in the same manner as that for AS with respect to the 
coupling weights between the input layer and the first hidden layer is highly possibly not zero. As a result, 
the output from the second neuron of the first hidden layer is highly possibly different between the learning 

20 data and the diagnosis data. This affect generally also appears in the output of the output layer. As a result, 
the learning data marked with circle shown in Fig. 27 and the diagnosis data (data which coincide with the 
circles are not shown) are highly probably discriminated. Where the configuration of the neural network is 
further complex, this trend is more prominent and the confirmation calculation described in connection with 
Figs. 27 and 28 is not required in many cases. 

25 In the analytical method of the present invention, it is naturally desired to eliminate the confirmation 
calculation in the diagnosis as much as possible. This may be attained to some extent where there are a 
plurality of outputs. This is explained in connection with the network of Fig. 25 and a network of Fig. 29 
having a similar configuration. As to the first output, the configuration of the neural network and the coupling 
weights are determined by the method explained in connection with Fig. 25. Initial weights (coupling 

30 weights) for the input are shown by Kj. When the second output of Fig. 29 is to be determined, the initial 
weights (coupling weights) for the input are set differently from those for the first output within the range 
which meets the respective requirements explained in connection with Fig. 25, and they are shown by Kj\ 
Where other outputs are present, the initial coupling weights for the input are set differently from those of 
the first and second output calculations in a similar manner. Thus, even if a difference between the learning 

35 data and the diagnosis data does not appear in the first output shown in Fig. 25, a difference may highly 
probably appear in the second output shown in Fig. 29 because the weights (coupling weights) to the input 
are changed. As a result, as a whole output data, the possibility of detection of difference between the 
learning data and the diagnosis data explained in Fig. 27 is increased. 

In accordance with the present method, since the number of hidden layers and the number of neurons 

40 are known and the number of coupling weights to be learnt is significantly decreased, the leaning time can 
be reduced. Where a program of the reverse propagation learning is already present, the modification of the 
program is easy and various functions of the existing program can be utilized while the advantages of the 
present invention is reserved. 

By the methods described above, the equivalent status to that attained by the conventional iterative 

45 learning is attained in an efficient manner. Accordingly, by using the input data used for the calculation, the 
calculation output which is totally equal to or very close to the teacher data can be outputted. If the number 
of cases which are handled in this manner is set large, an output which is approximated to a similar input 
case can be outputted even for a totally new case. Accordingly, the present embodiment is applicable to 
various recognitions and diagnosis. 

50 A comparison between the calculation by the present invention and the calculation by the conventional 
back propagation learning will be explained. The comparison was made by measuring corona noises of GIS 
in a known method, frequency analyzing the data stored in a flexible disk and reading representative data. 

In the conventional back propagation learning method, the neural network shown in Table 9 was used. 
In order to determine a structure of a sample GIS, the number of outputs from the output layer was set to 3. 

55 It was intended to determine up to 8 ( = 2 x 2 x 2) types. 
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Table 9 



Number of input data per case 


10 


Number of outputs of output layer 


3 


Number of hidden layers 


1 


Number of neurons of hidden layer 


5 


Number of cases 


14 



Three preferable outputs (teacher data) for each case are designated as shown in Table 10. 

Table 10 



Case Number 


Teacher Data 


Output 1 


Output 2 


Output 3 


1 


0 


0 


0 


2 


0 


0 


0 


3 


1 


0 


0 


4 


0 


1 


0 


5 


0 


0 


1 


6 


0 


0 


1 


7 


1 


1 


0 


8 


1 


1 


0 


9 


1 


0 


1 


10 


1 


0 


0 


11 


0 


1 


1 


12 


0 


1 


1 


13 


1 


0 


1 


14 


1 


0 


1 



Outputs calculated by the conventional back propagation learning method are shown in Tables 1 1 and 
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Table 1 1 



w 



15 





After 400 times of learning 


After 800 times of learning 


uuipui 1 


f*^> lini ■+ O 

UUtpUT <L 


UUtpUI o 


vjuxpui 1 


UUipUT c. 


L/UipUt O 


1 


u.uuu 


A ceo 


U.oyo 


n AAA 

u.uuu 


A A7fl 
U.4/0 


A CAA 
U.DU4 


2 


A ClAO 

U.U4<£ 


U.4ol 


A OOO 


A AAA 

U.UUU 


A 10Q 

U. loo 


A cno 
U.OUO 


o 


0.922 


0.01 o 


0.575 


1 .uuo 


A A -i C 
U.I 1 D 


U.lbU 


A 


U.UUU 


O.ybb 


U.54o 


A AAA 

u.uuu 


-t AAA 
1 .UUU 


A OC7 


c 
O 


0.071 


0.163 


0.461 


0.000 


0.000 


1.000 


D 


0.098 


0.166 


1.000 


0.002 


0.000 


1.000 


7 


0.957 


0.208 


0.486 


1.000 


0.956 


0.000 


8 


0.951 


1.000 


0.272 


1.000 


1.000 


0.000 


9 


1.000 


0.200 


0.394 


0.965 


0.000 


0.980 


10 


0.975 


0.131 


0.452 


1.000 


0.046 


0.136 


11 


0.000 


0.981 


1.000 


0.000 


1.000 


1.000 


12 


0.000 


0.552 


! 0.398 


0.000 


0.507 


0.465 


13 


0.996 


0.020 


0.972 


0.973 


0.009 


1.000 


14 


0.970 


0.031 


0.964 


0.980 


0.019 


1.000 


Calculation Time 


1 hour 47 minutes 


3 hours 34 minutes 


Total Error 


1.259152 


0.456008 
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Table 12 



w 



15 



20 



25 



30 



Case Number 


After 1 200 times of learning 


Ontnirt 1 




Oi itrti it *\ 


1 


n nnn 




n Rfifi 

u.ooo 


2 


n nnn 


n P4A 


n 1 


3 


1 nnn 


n n^n 

U.UUU 


n n97 


4 


n nnn 
u.uuu 


1 nnn 


n HfiQ 

u.uoy 


5 


0.000 


0.000 


1.000 


6 


0.022 


0.000 


1.000 


7 


1.000 


0.998 


0.000 


8 


1.000 


1.000 


0.000 


9 


0.990 


0.000 


0.994 


10 


1.000 


0.000 


0.038 


11 


0.000 


1.000 


1.000 


12 


0.000 


0.601 


0.537 


13 


0.984 


0.000 


1.000 


14 


0.989 


0.005 


1.000 


Calculation Time 


5 hours 21 minutes 


Total Error 


0.360491 ! 



40 



45 



In counting the number of times of learning, when the learning is completed one time for each case and 
all cases have been completed, it is counted as one time of learning. The numbers of times of learning are 
400 and 800 in Table 1 1 , and 1 200 in Table 1 2. It is ideal that the calculation output completely matches to 
Table 10 but it does not completely match in the present example. Where "0" is expected as the output, 
the number no smaller than 0.5 is not permissible. Similarly, where "1" is expected as the output, the 
number smaller than 0.5 is not permissible. The impermissible numbers are shown with underscore. As the 
number of times of learning increases from 400 to 800 and 1200, the number of underscored numbers 
decreases from 7 to 3 and 2. This shows that the calculation accuracy is enhanced. The times required for 
the calculation and the sums of the errors of the respective cases at the end of the calculation are also 
shown. In the present example, it is seen that five hours and twenty minutes are required for the 1200 times 
of learning and the decrease of the error is slow when the number of time of learning exceeds a certain 
number. The calculation of the back propagation learning is proceeded while the number of times of 
learning and the total error are displayed on the CRT. The total error does not decrease monotonously 
relative to the number of times of learning and it suddenly increases after decrease but gradually decreases 
as a general trend. In general, Table 12 is still insufficient for the required calculation precision and further 
learning is required. 

Calculation result by the method of Fig. 2 of the present invention by using the same input data is 
shown in Table 13. 
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Table 13 



Cass Number 


Calculation Output 


Ontnut 1 


Ontnut P 


Ontnut *\ 


1 




o 


o 


2 


o 


o 


o 


3 


j 


o 


o 


4 


u 


-J 


n 


5 


0 


0 


1 


Q 


0 


0 


1 


7 


1 


1 


0 


8 


1 


1 


0 


9 


1 


0 


1 


10 


1 


0 


0 


11 


0 


1 


1 


12 


0 


1 


1 


13 


1 


0 


1 


14 


1 


0 


1 


Calculation Time 


0.84 second 


Total Error 


0.000 



As seen from Table 13, the calculation output rounded at fourth digit below a decimal point completely 
matches to the teacher data shown in Table 10. The time required for the calculation of the coupling 
weights is 0.84 second. 

In the method of the present invention, since the analytical calculation is made and the iterative 
calculation to converge the number is not included, the calculation time is very short. 
The present embodiment offers the following advantages: 

(1) The number of hidden layers of the neural network and the number of neurons thereof can be 
determined base on the input data and the teacher data, and the coupling weights can be analytically 
calculated. Accordingly, the prior art reverse propagation iterative learning which requires a long time is 
not necessary and the time from the data input to the recognition and diagnosis can be reduced. 

(2) Some time of trial and error required in the prior art method to select the number of hidden layers 
and the number of neurons thereof is not necessary. Accordingly, the calculation can be proceeded 
efficiently. 

(3) It is applicable to a large scale neural network having a large number of input data and teacher data. 

(4) Where the input data is added together with the teacher data, whether it is worthwhile for adding can 
be determined, and if it is determined worthless, the addition can be cancelled to save the memory 
space of the computer. 

(5) Since the iterative learning is not necessary, the calculation of the coupling weights between neuron 
to the diagnosis can be done by more economic computer such as a notebook personal computer. 

(6) Where the teaching of the present invention is applied to the neural network which requires the 
conventional back propagation method, the time required for the learning is significantly reduced 
because the number of hidden layers and the number of neurons are known, the relation between the 
coupling weights are known and the number of coupling weights which actually require the learning is 
very small, and the functions of the conventional program can be utilized as they are. 

(7) Where the number of case of input data is large, they are classified into a plurality of classes based 
on some features of the cases and the learning and diagnosis are made for each class. Accordingly, the 
precision of diagnosis is enhanced even for similar or confusing cases. 



22 




EP 0 616 291 A2 



(8) A correct solution is attained for the input data having a scatter if the scatter is within the 
predetermined range of scatter. 

(9) A good method for selecting the coupling weights of the initial stage to classify the input data is 
provided and a selection efficiency is enhanced. 

s (10) In classifying the input data, a portion of data is nit used directly but reciprocals thereof are used so 
that the input data can be efficiently classified. 

(11) For the unlearned data, it may be inferred from the relation with adjacent data as it is done in the 
prior art, or the fact of not leant may be indicated by an appropriate expression such as "?" or a 
deviation from the learnt data may be indicated so that more correct diagnosis is attained. 
io (1 2) Where there is teacher data which is a correct solution to the diagnosed input data, the diagnosed 
input data and the correct solution teacher data are efficiently added to the learning data. Accordingly, 
the precision of diagnosis is enhanced. 

(13) The diagnosis data is inputted, the output is calculated and the most similar learning case is 
precisely inferred based on the output in a logically correct manner. 

75 While the above description is primarily based on a method, the present invention may be implemented 
by hardware having a similar function. The fields of application of the present invention are not only 
engineering fields but also biotechnology and medical fields as well as the recognition and diagnosis of 
physical, chemical and biological phenomena and the recognition and diagnosis in agricultural, educational, 
economical and other various fields. 

20 In accordance with the present invention, the coupling weights can be precisely and readily determined 
in a system such as the neural network, and the number of hidden layers and the number of neurons can 
be readily determined. 

Claims 

25 

1. A recognition or diagnosis method using a neural network having an input layer, at least a hidden layer 
and an output layer comprising the steps of: 

classifying learning data inputted to said input layer based on features thereof; 
inputting learning data for each class; 
30 calculating a total sum for each case; 

correlating corresponding teacher data to the total sum in the order of magnitude for each output of 
said output layer; and 

determining coupling weights based on the teacher data corresponding to a case having a 
maximum total sum, total sums before and after the change of the teacher data and the teacher data. 

35 

2. A recognition or diagnosis method using a neural network having an input layer, at least a hidden layer 
and an output layer comprising the steps of: 

inputting input data for each value for each case based on an upper limit and a lower limit of the 
input data of each case inputted to said input layer; 
40 calculating a total sum for each case; 

correlating corresponding teacher data to the total sum in the order of magnitude for each output of 
said output layer; and 

determining coupling weights based on the teacher data corresponding to a case having a 
maximum total sum, total sums before and after the change of the teacher data and the teacher data. 

45 

3. A recognition or diagnosis method according to Claim 2 wherein the input data inputted to said input 
layer are products of the upper limit and the lower limit multiplied by the same coupling weight. 

4. A recognition or diagnosis method using a neural network having an input layer, at least a hidden layer 
so and an output layer comprising the steps of: 

selecting coupling weights such that an input sum determined for a case other than a particular 
case inputted to said input layer does not fall between two input sums determined based on an upper 
limit and a lower limit of input data of said particular case and determining the input sum; 

inputting said input sum as input data; 
55 calculating an input sum for each case; 

calculating a total sum for each case; 

correlating corresponding teacher data to the input sum in the order of magnitude for each output 
of said output layer; and 
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determining coupling weights based on the teacher data corresponding to a case having a 
maximum total sum, total sums before and after the change of the teacher data and the teacher data. 

5. A recognition or diagnosis method according to Claim 4 wherein the teacher data of said particular case 
5 and the teacher data of a case other than said particular case are different from each other. 

6. A recognition or diagnosis method according to Claim 1 wherein two hidden layers are provided and 
the coupling weights are selected such that when the teacher data corresponding to the case having 
the maximum total sum is 0, the second hidden layer functions as an AND circuit and said output layer 

?o functions as an OR circuit. 

7. A recognition or diagnosis method according to Claim 1 wherein two hidden layers are provided and 
the coupling weights are selected such that when the teacher data corresponding to the case having, 
the maximum total sum is 1, the second hidden layer functions as an OR circuit and said output layer 

75 functions as an AND circuit. 

8. A recognition or diagnosis method according to Claim 1 wherein said data are used by reciprocals 
thereof for each case and a reversal operation in the magnitude of the input data is executed. 

20 9. A recognition or diagnosis method according to Claim 1 wherein a back propagation iterative learning 
method is used to calculate the coupling weights which are principal parts of the neural network. 

10. A recognition or diagnosis method using a neural network having an input layer, at least a hidden layer 
and an output layer comprising the steps of: 
25 inputting learning data to said input layer for each case; 

calculating a total sum for each case; 

correlating corresponding teacher data to the total sum in the order of magnitude for each output of 
said output layer; 

determining coupling weights based on the teacher data corresponding to a case having a 
30 maximum total sum, total sums before and after the change of the teacher data and the teacher data; 
and 

indicating the deviation from a specified range of scatter of the learning data when input diagnosis 
data is out of the range. 

35 11. A recognition or diagnosis method using a neural network having an input layer, at least a hidden layer 
and an output layer comprising the steps of: 

inputting learning data to said input layer for each case; 
calculating a total sum for each case; 

correlating corresponding teacher data to the total sum in the order of magnitude for each output of 
40 said output layer; 

determining coupling weights based on the teacher data corresponding to a case having a 
maximum total sum, total sums before and after the change of the teacher data and the teacher data; 
and 

transferring diagnosis data and the corresponding correct solution as a portion of the learning data 
45 after the diagnosis when the correct solution to the diagnosis data is known in inputting the diagnosis 
data. 

12. A recognition or diagnosis method using a neural network having an input layer, at least a hidden layer 
and an output layer comprising the steps of: 
50 inputting learning data to said input layer for each case; 

calculating a total sum for each case; 

correlating corresponding teacher data to the total sum in the order of magnitude for each output of 
said output layer; 

determining coupling weights based on the teacher data corresponding to a case having a 
55 maximum total sum, total sums before and after the change of the teacher data and the teacher data; 
and 

calculating an output for diagnosis data when the diagnosis data is inputted; 
inferring at least one probable learning data from the output; 
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comparing the diagnosis data with the learning data; and 
indicating the result thereof. 

13. A recognition or diagnosis method according to Claim 12 wherein said inferred learning data includes a 
5 scatter and the diagnosis data is compared with the learning data having the scatter and the result 

thereof is indicated. 

14. A recognition or diagnosis method according to Claim 1 wherein a first coupling weight to be multiplied 
to the input data of said input layer is selected to be different from other coupling weights. 

10 

15. A recognition or diagnosis method comprising the steps of: 

comparing diagnosis data directly with learning data; 

determining whether the diagnosis data and the learning data of a particular case totally match, 
partially match or totally mismatch; and 
75 indicating the result thereof. 

16. A recognition or diagnosis method comprising the steps of: 

comparing diagnosis data directly with learning data having a range of scatter; 
determining whether the diagnosis data is totally within said range, partially within said range or 
20 totally out of said range; and 

indicating the result thereof. 

17. A recognition or diagnosis method comprising the steps of: 

preparing simultaneous equations by using calculation formulas of inputs to a neuron based on an 
25 upper limit and a lower limit of the input data having a range of scatter; 

solving the simultaneous equations to determine coupling weights; and 
utilizing a neural network based on the coupling weights. 

18. A recognition or diagnosis method by using a neural network having an input layer, at least a hidden 
30 layer and an output layer weighted by coupling weights comprising the step of: 

self -evaluating teacher data for determining the coupling weights. 

19. An apparatus for diagnosing an electrical circuit to be used under a plurality of conditions by using a 
neural network comprising means for taking the plurality of conditions as learning data to determine 

35 coupling weights which form said neural network. 
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